Keyboard for inputting Chinese language
نویسنده
چکیده
1.1 Technique of inputting Chinese character As the structure of Chinese characters are very different from the relatively simple alphabetic system of western languages, it is very difficult to input Chinese characters into computer quickly and conveniently. There are a few existing systems which include those based on the "PinYin" (phonetic) system, a combination of the PinYin system and character form techniques, whole character encoding, stroke input encoding, and stoke form encoding. The PinYin (Phonetic) system The PinYin encoding system presents several advantages over other systems. First, it is already part of the curriculum of China's primary schools and many people are therefore familiar with the system. Besides, PinYin uses English characters as its encoding characters. Therefore it is a simple matter to directly encode Chinese characters by using standard English character computer keyboards. However, since the output of the PinYin encoding system consists of Chinese characters, and not the PinYin forms of those characters, there are too many degenerate cases. Generally some software processing is used to compensate this effect. For a language such as Chinese, with its many dialects and slangs, there are simply too many Chinese characters for the user to remember or pronounce correctly. The whole character system The whole character keyboard is based on the use of a special hardware keyboard and not the standard English language keyboard. The input speed of this keyboard is comparatively low and is not suitable for wide use. Stroke input encoding system The stroke input encoding system is easy to learn. But it presents too many degenerate cases and is not suitable for entering large amounts of data. Stoke from encoding system The stoke form encoding system is based on a process of first determining the radicals of the characters, then encoding them according to an established set of rules. The stoke form systems developed to date, however, have suffered from the lack of a suitably developed theoretical structure. Therefore, the system still suffers from a large number of synonym codes and slow input speed. It is also difficult to use. The Chinese character strokes and symbols are so different and so complicated that they can be stored and grouped in a wide variety of ways. The conventional keyboard does not support the pictorial characters in Chinese language. There are 3000 to 6000 commonly used pictorial Chinese characters (Hanzi). There are too many dialects and slangs in Chinese which …
منابع مشابه
Keyboards for inputting Japanese language-arxiv
The most commonly used Japanese alphabets are Kanji, Hiragana and Katakana. The Kanji alphabet includes pictographs or ideographic characters that were adopted from the Chinese alphabet. Hiragana and Katakana are phonetic alphabets that do not include any characters common to each other or to Kanji. Hiragana is used to spell words of Japanese origin, while Katakana is used to spell words of wes...
متن کاملA Geometric-based Sketch Recognition Approach for Handwritten Mandarin Phonetic Symbols I
Inputting written Chinese, unlike written English, is a non-trivial operation using a standard keyboard. To accommodate this operation, numerous existing phonetic systems using the Roman alphabet were adopted as a means of input while still making use of a Western keyboard. With the growing prevalence of computing devices capable of pen-based input, naturally sketching written Chinese using a p...
متن کاملA Chinese Characters Coding Scheme For Computer Input And Internal Stopage
A coding scheme for inputting Chinese characters by means of a conventional keyboard has been developed. The code for each Chinese character is composed of two strings of keys, one corresponds to the spelling and the other the ideographic property of the character. Each code requires no more than seven keys (average five and a half keys) and 99.5% of the ten thousand characters in a dictionary ...
متن کاملMule : MULtilingual Enhancement to GNU Emacs
Mule: MULtilingual Enhancement to GNU Emacs handles not only ASCII characters (7 bits) and ISO Latin-1 (8 bits), but also Greek, Cyrillic, He-brew, Thai, Vietnamese (8 bits), Japanese, Chinese, Korean (16 bits) coded in the ISO2022 standard and its variants (e.g. EUC, Compound Text). A text buffer of Mule holds a mixture of these languages. As for inputting, Mule provides various input methods ...
متن کاملIntelligent Key Prediction by N-grams and Error-correction Rules
In this paper, we propose an intelligent aid for text input method in order to provide an easier way for text inputting with the conventional keyboard. We use the character n-gram model and error correction rules to identify the language being typed and to predict the most probable character string without extra keystroke. The character n-gram model is also required only a small amount of memor...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1310.3111 شماره
صفحات -
تاریخ انتشار 2013